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5 Related Applications 

The present application is a continuation of U.S. Patent Application Serial No. 
09/535,262, filed March 23, 2000, and claims priority to U.S. Provisional Application 
Serial No. 60/125,953, filed March 24, 1999, both of which are hereby expressly 
incorporated by reference in their entireties. 
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Background of the Invention 

Field of the Invention 

The present invention relates to a method for generating transcriptionally 
active DNA fragments. More specifically, the method relates to synthesis of a DNA 
15 fragment by polymerase chain reaction (PGR) using nested primers, promoter 

sequences and terminator sequences. 



Description of the Related Art 

In addition to the tremendous progress made in the past few years in 

20 sequencing the human genome, efforts have also been made to sequence other 

organisms that are of biomedical importance. For example, complete genomic 
sequences have been obtained for Borrelia burgdorferi (cause of Lyme disease), 
Chlamydia, Heliobacter pylori and Mycobacterium tuberculosis. The fast-growing 
sequence information provides immense opportunities to reveal the basic biology of 

25 related organisms at the gene/molecular level and to develop novel therapeutics or 
vaccines against various pathogens. 

However, this vast sequence information also mandates a much more efficient 
and streamlined way to screen and identify genes of interest from tens of thousands of 
candidate genes. The conventional approach to gene screening and identification 

30 involves generation of a cDNA library, subcloning the DNA inserts into plasmid 



vectors (expression vectors), purifying plasmid DNA from bacteria for each individual 
cDNA clone and transfecting animal cells or tissues for functional analysis of the 
encoded gene product. This method, even in conjvmction with the use of polymerase 
chain reaction (PGR) to generate cDNA fragments to allow directional and in-frame 
5 cloning, is still time consuming, costly and difficult to automate. 

The present invention provides a simple, rapid method for the generation of 
transcriptionally active DNA fragments. 

Sunmxarv of the Invention 

10 One embodiment of the present invention is a method for generating a 

transcriptionally active DNA molecule, comprising polymerase chain reaction (PGR) 
amplification of said DNA molecule in the presence of a first DNA fragment (Fl), 
second DNA fragment (F2), first primer (PI), a second primer (P2), a third primer 
(P3) and a fourth primer (P4) wherein: Fl comprises a promoter sequence; F2 

15 comprises a terminator sequence; PI is complementary to the 5' end of Fl; P2 is 

complementary to the 3' end of F2; P3 comprises a first region complementary to the 
3' end of Fl and a second region complementary to the 5' end of said DNA molecule; 
P4 comprises a first region complementary to the 5' end of F2 and a second region 
complementary to the 3* end of said DNA molecule, whereby a transcriptionally 

20 active DNA molecule is produced by said PGR ampUfication. Preferably, Fl is the 

cytomegalovirus IE promoter. In one aspect of this preferred embodiment, the 
transcriptionally active DNA molecule encodes a therapeutic gene. The method may 
further comprise the step of adding a PNA tail to the 5'-end of PI and P2 prior to the 
PGR amplification. Preferably, a thymidine base immediately precedes the region of 

25 complementarity between the third primer P3 and the first DNA fragment Fl. In 

another aspect of this preferred embodiment, a thymidine base immediately precedes 
the region of complementarity between the fourth primer P3 and the second DNA 
fragment F2. The method may also further comprise the step of adding a PNA clamp 
to said transcriptionally active DNA molecule after said PGR amplification. 
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Preferably, the method further comprises the step of adding a PNA molecule via a 
linker (PNA clamp tail) to primers PI and P2 prior to the PGR amplification. 

Brief Description of the Drawings 
Figiure 1 is a schematic diagram of the gene amplification method of the 

5 present invention. Oligonucleotide primers PI and P2 are complementary to the 5' 

end of a promoter sequence (Fl) and the 3' end of a terminator sequence (F2), 
respectively. P3 and P4 are oligonucleotide primers which contain regions 
complementary to opposite ends of Gene X. In addition, P3 and P4 contain regions 
which are complementary to the 3' region of Fl and the 5' region of F2. An excess 

10 amount of primers PI and P2 are combined with Gene X, or with a mixture of DNA 

containing Gene X, and with Fl, F2, P3 and P4, or with a mixture of DNA containing 
Fl, F2 and Gene X which has been PGR amplified using P3 and P4, and subjected to 
PGR to produce a transcriptionally active linear DNA firagment containing Fl, F2 and 
gene X. 

15 Figure 2 is a schematic diagram showing DNA firagment Fl which contains a 

thymidine base immediately preceding the region of complementarity with the PGR 
intermediate (primer P3). 

Figure 3 is a schematic diagram showing the use of peptide nucleic acid (PNA) 
sequences and PNA clamps to protect the ends of linear expression DNA firagments. 

20 Primers PI and P2 contain a PNA tail which is resistant to proteolytic and 
exonuclease degradation. The resuhing linear expression DNA firagment contains a 
PNA tail at each of the S'-ends. The 3' ends are protected by addition of a PNA 
"clamp" which is also resistant to proteolytic and exonuclease degradation. 

Figure 4 is a schematic diagram showing the use of a PNA "clamp" tail to 

25 protect the 5' ends of the linear expression DNA firagment, followed by formation of a 

clamp during PGR to protect the 3' ends. 

Detailed Description of the Preferred Embodiments 
The present invention provides a simple, efficient method for generating 
30 transcriptionally active DNA fi-agments which can be readily transfected into animal 
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cells or tissues by conventional nucleic acid transfection techniques, without the need 
for subcloning into expression vectors and purification of plasmid DNA from bacteria. 
The transcriptionally active DNA fragments are synthesized by polymerase chain 
reaction (PGR) ampHfication of any gene of interest using nested oligonucleotide 
5 primers and two DNA fragments, one of which comprises an active transcription 
promoter sequence and one of which comprises a basic transcription terminator 
element. A first primer is complementary to the DNA fragment comprising the 
promoter; a second primer is complementary to the DNA fragment comprising the 
terminator; a third primer is complementary to both the promoter sequence and one 
10 end of the gene of interest; and a fourth primer is complementary to both the 

terminator sequence and the other end of the gene of interest. These promoters, genes 
and terminators are linked in an expression cassette as shown in Figure 1. In addition, 
the ends of linear DNA containing the expression cassette can be protected from 
exonuclease digestion during and after transfection by incorporating peptide nucleic 
1 5 acid (PNA) sequences and PNA clamps as shown in Figures 3 and 4. 

As used herein, the term "promoter" is a DNA sequence which extends 
upstream from the transcription initiation site and is involved in binding of RNA 
polymerase. The promoter may contain several short (<10 base pair) sequence 
elements that bind transcription factors, generally dispersed over >200 base pairs. A 
20 promoter that contains only elements recognized by general and upstream factors is 

usually transcribed in any cell type. Such promoters may be responsible for 
expression of cellular genes that are constitutively expressed (sometimes called 
housekeeping genes). There are also tissue-specific promoters limited to particular 
cell types, such as the human metallothionein (MT) promoter which is upregulated by 
25 heavy metal ions and glucocorticoids. 

As used herein, the term "terminator" is a DNA sequence represented at the 
end of the transcript that causes RNA polymerase to terminate transcription. This 
occurs at a discrete site downstream of the mature 3' end which is generated by 
cleavage and polyadenylation. 
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The present method can be used to quickly generate nuclease-resistant and 
transcriptionally active linear DNA molecules which express any desired gene with 
any known sequence. The linear DNA can then be deUvered into animal cells or 
tissues for functional analysis, vaccination and other pharmaceutical applications. 
5 This method also avoids problems associated with bacterial growth such as toxicity 

and stabiUty. This method can also be completely automated for use in high- 
throughput screening methods. 

Referring now to Figure 1, oligonucleotide primers PI and P2 are 
complementary to the 5' region of DNA fragments Fl and F2, respectively. Fragment 

10 Fl comprises a transcription promoter (darkened region) and fragment F2 comprises a 
transcription terminator (darkened region). Primers P3 and P4 are complementary to 
the 3' ends of Fl and F2, respectively, and to different ends of the DNA fragment 
containing Gene X. Figure 1 shows the putative intermediates and the final product 
which are formed in the present PCR-based method when PI, P2, P3, P4 , Fl and F2 

15 are added to a DNA template comprising Gene X. It should be noted that amount of 

primers PI and P2 present in the reaction mixture are preferably at least about 100 
fold greater than the amounts of P3, P4, Fl and F2, to minimized the amounts of 
intermediate products generated during the reaction. To generate the first 
intermediate, primer P3 and P4 are used which amplify from the promoter and 

20 terminator sequences of Fl and F2 across Gene X, resulting in an intermediate having 

the promoter and terminator sequences on the ends and Gene X in between these 
sequences. 

The second intermediate is formed by hybridization of the first intermediate 
with fragments Fl and F2 via their complementary promoter and terminator sequences 
25 (darkened regions), followed by PGR amplification from primers P3 and P4, resulting 

in a DNA segment comprising the entire Fl, F2 and Gene X. 

In the last step of the reaction, PGR amplification of the second intermediate 
using primers PI and P2 results in ampHfied amounts of the complete transcriptionally 
active DNA fragment. 



-5- 



The method of the invention may be perfomied by adding all of the 
components in a single reaction mixture. Altematively, two separate PGR reactions 
can be performed. In the first reaction, the template gene, P3 and P4 are used first. 
The product of this reaction is then used as a template for a second PGR reaction 

5 involving fragments Fl and F2, plus primers PI and P2. 

The generation of final products using the nested PGR method describe above 
is dependent on the sequences of the Fl or F2 fragments at the junction between the 
region overlapping P3 and P4, respectively. This is due, at least in part, to the 
addition of an extra adenosine base (A) at the 3' end of the PGR fragment by Taq 

10 DNA polymerase which is commonly used in PGR protocols. This could produce a 

mismatch between the PGR intermediate generated by P3/P4 and fragments Fl and 
F2. 

Thus, in a preferred embodiment, the overlap between the PGR intermediate 
(P3 primer) and Fl, and P4/F2, is designed such that a thymidine base (T) 

15 immediately precedes the overlap region in fragments Fl and F2 (Fig. 2, only Fl 
shown). In this case, even with the addition of an "A" base to the 3' end to the 
intermediate during PGR amplification, the overlap between F1/F2 and the respective 
PGR intermediates is still perfectly maintained. 

Peptide nucleic acids (PNA) are nucleic acid analogs in which the entire 

20 deoxyribose-phosphate backbone has been exchanged with a chemically completely 
different, but structurally homologous, polyamide (peptide) backbone containing 2- 
aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the PNA 
backbone is neutral. Therefore, there is much less repulsive energy between 
complementary strands in a PNA-DNA hybrid than the comparable DNA-DNA 

25 hybrid, and consequently they are much more stable. 

In addition, molecules called PNA "clamps" have been synthesized which 
have two identical PNA sequences joined by a flexible hairpin Unker containing three 
8-amino-3,6-dioxaoctanoic acid units. When a PNA clamp is mixed with a 
complementary homopurine or homopyrimidine DNA target sequence, a PNA-DNA- 

30 PNA triplex hybrid can form which is extremely stable (Dentin et al. Biochemistry 



35:8863-8869, 1996; Egholm et al., Nucleic Acids Res, 23:217-222, 1995; Griffith et 
al., J. Am, Chem, Soc, 117:831-832, 1995). The sequence-specific and high affinity 
duplex and triplex binding of PNA have been extensively described (Nielsen et al., 
Science 254:1497-1500, 1991; Egholm et al, J, Am, Chem, Soc, 114:9677-9678, 
5 1992; Egholm et al, Nature 365:566-568, 1993; Almarsson et al., Proc. Natl, Acad, 

Sci, U, S A, 90:9542-9546, 1993; Demidov et al, Proc. Natl, Acad Sci. U. S A. 
92:2637-2641, 1995). They have also been shown to be resistant to nuclease and 
protease digestion (Demidov et al., Biochem, Pharm, 48:1010-1013, 1994). 

A representative PNA clamp is shown in Figs. 3 and 4. This PNA clamp, or a 

10 PNA clamp having any desired DNA sequence, is synthesized as described by 
Egholm et al. (supra,) or is available from PerSeptive Biosystems (Framingham, MA). 
The representative PNA clamp shown in Figs. 2 and 3 has the sequence 5'- 
TCTCTCTC-O-O-O-TJTJTJTJ (SEQ ID NO: 1), where J (pseudoisocytosine) is an 
analog of C, and the "O" residues are 8-amino-3,6-dioxaoctanoic acid linkers which 

15 separate the two regions of the PNA. 

The use of PNA tails, PNA clamps and PNA "clamp tails" to protect the ends 
of the transcriptionally active PCR fragments which are described above is illustrated 
in Figure 2. The ends of the PCR fragments generated as shown in Figure 1 are prone 
to digestion by exonucleases after transfection into a cell or tissue. Two strategies for 

20 inhibiting this degradation are shown in Figures 3 and 4. In Figure 1, a PNA tail and a 
short DNA sequence (5^GATC-0-TCTCTCTC-3'; SEQ ID NO: 2) is added to the 5' 
end of primers PI and P2 which bind to the target sequence 5'-GAGAGAGA-3' (SEQ 
ID NO: 3). This generates a PNA binding site at the 3' end of the opposite strand 
during PCR. Primers containing PNA tails can be synthesized using methods well 

25 known in the art or are available from PerSeptive Biosystems. These PNA tails do not 
hybridize to a target sequence, but protect the 5 '-ends from exonuclease digestion. In 
order to protect the 5' ends of the PCR fragment, a PNA molecule is added to the 3'- 
PNA tail-protected PCR fragment. This PNA contains a DNA sequence which is 
complementary to the 3 '-end of the PCR fragment and binds thereto as shown in Fig. 

30 3. 



An alternate PNA protection approach is shown in Fig. 4. In this method, a 
PNA clamp is linked to primers PI and P2 via one or more 8-amino-3,6-dioxaoctanoic 
acid linkers to form a PNA "clamp" tail by well known methods. After PGR 
ampUfication, the "clamp tail" binds to the 3' ends of the PGR fragment, and is linked 
5 to the 5' ends via the one or more 8-amino-3,6-dioxaoctanoic acid linkers to protect 
both the 3' and 5' ends of the PGR fragment. 

Although the use of the GMV IE promoter and an artificial mammalian 
transcriptional terminator elements are exemplified herein, the use of any eukaryotic 
promoter and terminator is within the scope of the present invention. Suitable 

10 promoters for use in the present invention include, for example, SV40, rous sarcoma 

virus (RSV), retroviral long terminal repeats (LTR), muscle creatine kinase promoter, 
actin promoter, elongation factor promoter, synthetic promoters, tissue-specific 
promoters, and the like. Suitable terminator sequences include SV40 transcription 
terminator, bovine growth hormone (BGH) terminator, synthetic terminators, and the 

1 5 like. These promoter and terminator sequences can be obtained by restriction enzyme 
digestion of commercially available plasmids and cDNA molecules, or can be 
synthesized using an automated DNA synthesizer using methods well known in the 
art. 

Any desired gene may be amplified and coupled to active promoter and 
20 terminator sequences using the method of the present invention. In a preferred 
embodiment, the gene encodes a gene product which is absent or present at reduced 
levels in an organism. Nonlimiting examples of these gene products are the cystic 
fibrosis transmembrane regulator (GFTR), insuUn, dystrophin, interleukin-2, 
interleukin-12, erythropoietin, gamma interferon, and granulocyte macrophage colony 
25 stimulating factor (GM-GSF). 

Although any transfection method well known in the art may be used to 
transfect the transcriptionally active PGR fragments of the invention into cells or 
tissues including calcium phosphate precipitation, electroporation and DEAE-dextran, 
cationic lipid-mediated transfection is preferred. Gene delivery systems are described 
30 by Feigner et al. (Hum, Gene Ther, 8:511-512, 1997) and include cationic lipid-based 



delivery systems (lipoplex), polycation-based delivery systems (polyplex) and a 
combination thereof (lipopolyplex). Cationic lipids are described in U. S, Patent Nos. 
4,897,355 and 5,459,127, the entire contents of which are hereby incorporated by 
reference. 

5 Example 1 

Production of transcriptionally active PGR fragments 
The following components are combined in a 100 |li1 reaction volume: DNA 
fragment Fl (4 ng) which comprises a region of high transcriptional potency (-240 to 
+60) from the human cytomegalovirus (CMV) immediate early gene (IE) 

10 promoter/enhancer, DNA fragment F2 (4 ng), a 55 base pair oligonucleotide encoding 
an artificial mammalian transcription terminator element (5'- 
CACAAAAAACCAACACACAGATCTCTAGAGCTCTGATCTTTTATTAGCCA 
GAAGT-3'; SEQ ID NO: 4), 400 ng primer PI (5'- 
TCTCTCTACGTATTAGTCATCG-3'; SEQ ID NO: 5), 400 ng primer P2 (5'- 

15 TCACAAAAAACCAACACACAG-3'; SEQ ID NO: 6), 4 ng primer P3 and 4 ng 
primer P4. The PI and P2 primer sequences correspond to the 5' end of fragment Fl 
and the 5' end of fragment F2, respectively (Fig. 1). Primers P3 and P4 are designed 
based on the gene sequence of interest. The 3' potion of these primers (about 10-20 
base pairs) is determined by the actual sequence of the gene to be amplified, whereas 

20 the 5' region of P3 comprises the following sequence: 5'-CTCCGCGGATCCAGA-3' 
(SEQ ID NO: 7) to overlap with the 3' region of Fl, and P4 comprises the sequence 
5'-TTATTAGCCAGAAGT-3' (SEQ ID NO: 8) to overlap with the 3' region of F2. 

PGR is performed as follows: denaturation for 30 seconds at 94°C, anneaUng 
for 45 seconds at 55''C and extension for three minutes at 72°C for 25-30 cycles. The 

25 size of the final produce is verified by 1% agarose gel electrophoresis. The amplified 
PGR fragment is cleaned and purified using a commercial PGR cleaning kit (e.g., 
Qiagen), and can be used for in vitro or in vivo transfection of cells or tissues. 

Example 2 

The green fluorescent protein (GFP, 700 bp) was used as a target gene. The Fl 
30 fragment was a 600 bp fragment of the GMV inmiediate early gene promoter (from - 



550 to +50). A 50 bp oligonucleotide containing a modified rabbit beta-globin gene 
transcription terminator was used as primer 2. Two different versions of these primers 
were designed so that the overlap between Fl and the intermediate GFP PGR 
fragment was either preceded with or without a thymidine base. After the PGR 
5 reaction was carried out according to the conditions described in Example I, the 
products were analyzed by electrophoresis. When a thymidine base preceded the 
overlap region in Fl, a clean major PGR fragment of about 1.4 kb was produced, 
representing the GFP coding region flanked by GMV promoter (Fl) and the 
transcription terminator, whereas if a base other than thymidine preceded the 

10 overlapping region, no product was generated. The fragment generated by nested 

PGR was then transfected into Gos-7 cells and the expression of GFP was monitored 
by fluorescence microscopy. The results showed that the intensity and frequency of 
GFP-expressing cells was almost the same between cells transfected with the PGR 
fragment and a supercoiled plasmid DNA in which the same GMV promoter was 

15 driving expression of the GFP gene. Thus, the present method produces DNA 

fragments which are transcriptionally active. 

While particular embodiments of the invention have been described in detail, it 
will be apparent to those skilled in the art that these embodiments are exemplary 
rather than limiting, and the true scope of the invention is that defined in the following 

20 claims. 
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